Markov decision process - PDFSEARCH.IO - Document Search Engine

Markov decision process
Results: 537

#	Item
21	On Structural Properties of MDPs that Bound Loss due to Shallow Planning 1 Nan Jiang1 and Satinder Singh1 and Ambuj Tewari2 Computer Science and Engineering, University of Michigan 2 Add to Reading List Source URL: dept.stat.lsa.umich.edu Language: English - Date: 2016-04-20 13:16:34 Field theory Mathematics Algebraic geometry Valuation Markov decision process Probability theory Logic Natural deduction
22	Journal of Artificial Intelligence Research–1178 Submitted 12/15; publishedExploiting Causality for Selective Belief Filtering in Dynamic Bayesian Networks Add to Reading List Source URL: jair.org Language: English - Date: 2016-04-28 15:06:13 Statistics Graphical models Probability Estimation theory Robot control Dynamic Bayesian network Probability theory Bayesian network Causality Particle filter Partially observable Markov decision process Belief propagation
23	Sequential Decision Making in Repeated Coalition Formation under Uncertainty Georgios Chalkiadakis Craig Boutilier Add to Reading List Source URL: www.intelligence.tuc.gr Language: English - Date: 2008-02-08 15:14:59 Statistics Game theory Statistical inference Dynamic programming Markov processes Stochastic control Probability theory Partially observable Markov decision process Coalition Cooperative game theory Core Bayesian game
24	Bias in Natural Actor-Critic Algorithms Philip S. Thomas Department of Computer Science, University of Massachusetts, Amherst, MAUSA Add to Reading List Source URL: psthomas.com Language: English - Date: 2012-10-01 18:27:53 Statistics Statistical theory Estimation theory Dynamic programming Markov decision process Stochastic control Bias of an estimator Reinforcement learning Loss function Fisher information
25	Verification of Markov Decision Processes using Learning Algorithms? Tom´asˇ Br´azdil1 , Krishnendu Chatterjee2 , Martin Chmel´ık2 , Vojtˇech Forejt3 , Jan Kˇret´ınsk´y2 , Marta Kwiatkowska3 , David Parker4 , a Add to Reading List Source URL: www.hieratic.eu Language: English Computational complexity theory Theory of computation Dynamic programming Markov decision process Stochastic control Analysis of algorithms Mathematical logic Reinforcement learning Time complexity Algorithm PP
26	Online Development of Assistive Robot Behaviors for Collaborative Manipulation and Human-Robot Teamwork Bradley Hayes and Brian Scassellati Dept. of Computer Science, Yale University Human-robot teaming has the potential Add to Reading List Source URL: bradhayes.info Language: English - Date: 2016-07-11 15:51:46 Dynamic programming Partially observable Markov decision process Stochastic control Hierarchical task network Robotics Robot
27	Language Understanding for Text-based Games using Deep Reinforcement Learning Karthik Narasimhan∗ CSAIL, MIT Add to Reading List Source URL: arxiv.org Language: English - Date: 2015-09-14 21:25:04 Artificial neural networks Computational neuroscience Cybernetics Q-learning Recurrent neural network Reinforcement learning Long short-term memory DQN Markov decision process Sepp Hochreiter Artificial intelligence Quest
28	Coordination in Multiagent Reinforcement Learning: A Bayesian Approach Georgios Chalkiadakis Craig Boutilier Add to Reading List Source URL: www.intelligence.tuc.gr Language: English - Date: 2009-03-02 16:24:03 Game theory Reinforcement learning Nash equilibrium Q-learning Strategy Partially observable Markov decision process Action selection Best response Bellman equation Zero-sum game Agent-based model Solution concept
29	Sutton, Richard PIN Add to Reading List Source URL: webdocs.cs.ualberta.ca Language: English - Date: 2013-10-18 16:05:54 Computational neuroscience Belief revision Reinforcement learning Computational statistics Q-learning Temporal difference learning Artificial neural network Machine learning Markov decision process Mathematical optimization Algorithm Gradient descent
30	Learning from Demonstrations: Is It Worth Estimating a Reward Function? Bilal Piot1,2 , Matthieu Geist1 , Olivier Pietquin1,2 1 Supélec, IMS-MaLIS Research group, France Add to Reading List Source URL: www.ilhaire.eu Language: English - Date: 2013-10-03 05:33:46 Operations research Dynamic programming Stochastic control Markov processes Reinforcement learning Markov decision process Valuation Algorithm Mathematical optimization